首页> 外文OA文献 >A MapReduce-based nearest neighbor approach for big-data-driven traffic flow prediction
【2h】

A MapReduce-based nearest neighbor approach for big-data-driven traffic flow prediction

机译:基于MapReduce的最近邻方法用于大数据驱动的交通流预测

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In big-data-driven traffic flow prediction systems, the robustness of prediction performance depends on accuracy and timeliness. This paper presents a new MapReduce-based nearest neighbor (NN) approach for traffic flow prediction using correlation analysis (TFPC) on a Hadoop platform. In particular, we develop a real-time prediction system including two key modules, i.e., offline distributed training (ODT) and online parallel prediction (OPP). Moreover, we build a parallel k-nearest neighbor optimization classifier, which incorporates correlation information among traffic flows into the classification process. Finally, we propose a novel prediction calculation method, combining the current data observed in OPP and the classification results obtained from large-scale historical data in ODT, to generate traffic flow prediction in real time. The empirical study on real-world traffic flow big data using the leave-one-out cross validation method shows that TFPC significantly outperforms four state-of-the-art prediction approaches, i.e., autoregressive integrated moving average, Naïve Bayes, multilayer perceptron neural networks, and NN regression, in terms of accuracy, which can be improved 90.07% in the best case, with an average mean absolute percent error of 5.53%. In addition, it displays excellent speedup, scaleup, and sizeup.
机译:在大数据驱动的交通流量预测系统中,预测性能的鲁棒性取决于准确性和及时性。本文提出了一种新的基于MapReduce的最近邻(NN)方法,用于在Hadoop平台上使用相关分析(TFPC)进行交通流预测。特别是,我们开发了包括两个关键模块的实时预测系统,即离线分布式训练(ODT)和在线并行预测(OPP)。此外,我们构建了并行的k最近邻优化分类器,该分类器将流量之间的相关性信息纳入分类过程。最后,我们提出了一种新颖的预测计算方法,将在OPP中观察到的当前数据与从ODT中的大规模历史数据获得的分类结果相结合,以实时生成交通流量预测。使用留一法交叉验证方法对现实世界交通流量大数据进行的实证研究表明,TFPC明显优于四种最新的预测方法,即自回归积分移动平均,朴素贝叶斯,多层感知器神经网络和NN回归的准确性方面,在最佳情况下可以提高90.07%,平均绝对绝对误差为5.53%。此外,它还具有出色的加速,放大和放大效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号